A Note on Sequence Prediction over Large Alphabets
نویسنده
چکیده
Building on results from data compression, we prove nearly tight bounds on how well sequences of length n can be predicted in terms of the size σ of the alphabet and the length k of the context considered when making predictions. We compare the performance achievable by an adaptive predictor with no advance knowledge of the sequence, to the performance achievable by the optimal static predictor using a table listing the frequency of each (k + 1)-tuple in the sequence. We show that, if the elements of the sequence are chosen uniformly at random, then an adaptive predictor can compete in the expected case if k ≤ logσ n− 3− , for a constant > 0, but not if k ≥ logσ n.
منابع مشابه
Estimation of Hydrodynamic Force on Rough Circular Cylinders in Random Waves and Currents (RESEARCH NOTE)
Most of the Codes of Practice (API, BSI, DnV, NPD) uses Morison's equation to estimate hydrodynamic loads on fixed and moving offshore structures. The significant difference in the prediction of the loads mainly arises from the assumption of the values of hydrodynamic coefficients. In this paper by analysing a full scale set of data in large KC's numbers collected from Delta Wave Flume in the N...
متن کاملA Note on the Strong Law of Large Numbers
Petrov (1996) proved the connection between general moment conditions and the applicability of the strong law of large numbers to a sequence of pairwise independent and identically distributed random variables. This note examines this connection to a sequence of pairwise negative quadrant dependent (NQD) and identically distributed random variables. As a consequence of the main theorem ...
متن کاملDesigning succinct structural alphabets
MOTIVATION The 3D structure of a protein sequence can be assembled from the substructures corresponding to small segments of this sequence. For each small sequence segment, there are only a few more likely substructures. We call them the 'structural alphabet' for this segment. Classical approaches such as ROSETTA used sequence profile and secondary structure information, to predict structural f...
متن کاملMinimizing distortion caused by welding, by sequencing optimization in a large steel panel
Increasingly, Welding is used in industry for assembled various products, such as ships, cars, trains and bridges. Welding distortion often results such as lack of accuracy during assembly and will have increases manufacturing costs. So, predict and reduce welding distortion is very important to improve the quality of welded structures. In this study, firstly, a prediction method of welding di...
متن کاملMinimizing distortion caused by welding, by sequencing optimization in a large steel panel
Increasingly, Welding is used in industry for assembled various products, such as ships, cars, trains and bridges. Welding distortion often results such as lack of accuracy during assembly and will have increases manufacturing costs. So, predict and reduce welding distortion is very important to improve the quality of welded structures. In this study, firstly, a prediction method of welding di...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Algorithms
دوره 5 شماره
صفحات -
تاریخ انتشار 2012